EXPERIMENTAL EVALUATION OF SEGMENTAL HMMS - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
نویسندگان
چکیده
The aim of the research described in this paper is to overcome important speech-modeling limitations of conventional hidden Markov models (HMMs), by developing a dynamic segmental HMM which models the changing pattern of speech over the duration of some phoneme-type unit. As a first step towards this goal, a static segmental HMM [3] has been implemented and tested, This model reduces the influence of the independence assumption by using two processes to model variability due to long-term factors separately from local variability that occurs within a segment. Experiments have demonstrated that the performance of segmental HMMs relative to conventional HMMs is dependent on the “quality” of the system in which they are embedded. On a connected-digit recognition task for example, static segmental HMMs outperformed conventional HMMs for triphone systems but not for a vocabulary-independent monophone system. It is concluded that static segmental HMMs improve performance, as long as the system is such that the independence assumption is a major limiting factor.
منابع مشابه
Sequential homotopy-based computation of multiple solutions to nonlinear equations
IEEE Intl. Conf. Acoustics, Speech & Signal Processing (ICASSP) May 1995 Homotopy methods have achieved significant success in solving systems of nonlinear equations for which the number of solutions are known and the homotopy paths are bounded. We present a twostage homotopyprocess which does not require a-priori knowledge of the number of solutions to a system of nonlinear equations. This app...
متن کاملCOMPARISON OF A NEW HYBRID CONNECTIONIST-SCHMM APPROACH WITH OTHER HYBRID APPROACHES FOR SPEECH RECO - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
This paper compares a newly proposed hybrid connectionist-SCHMM approach [5] with other hybrid a p proaches. In the new approach a multilayer perceptron (MLP) replaces the conventional codebooks of semicontinuous HMMs. The MLP is therefore trained on s w d k d basic elements (phones and phone parts) in such a way that the outputs of the network estimate the a posteriori probabilities of these e...
متن کاملMAXIMUM SINR BEAMFORMING FOR CORRELATED SOURCES - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
AB-STRACT In this paper, we consider the signal-to-interference plus noise ratio (SINR) performance of several beamforming algorithms, taking particular account of the contribution of sources correlated with the desired signal. In addition, we derive an optimal method that maximizes SINR by combining with the desired signal estimate any components of the interference/multipaths that are correla...
متن کاملDSP-BASED MOBILE AND SATELLITE RECEIVERS, FROM ALGORITHM TO IMPLEIMENTATION: A DESIGN COURSE AT AACH - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Profound knowledge of the interaction between algorithms and digital signal processor (DSP) architectures is required to be able to efficiently design complex communications equipment. Whereas both algorithms and architecture find treatment in many courses individually, education focusing on design methodology for DSP implementation is found to be rare. This contribution describes a concept and...
متن کاملDIGITAL IMAGE HALFTONING BY NOISE THRESHOLDING - Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Approaches for digital halftoning of images using dithering threshold the input image with additive dithering noise. The paper presents a technique which thresholds the noise directly. The threshold is modified at each step such that the expected value of the output is equal to the input pixel’s gray value. Further, error feedback is used to correct the threshold. Tests on images show the metho...
متن کامل